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Abstract — In duty-cycled wireless sensor networks, the nodes 
switch between active and dormant states, and each node 
may determine its active/dormant schedule independently. This 
complicates the Minimum-Energy Multicasting (MEM) problem, 
which has been primarily studied in always-active wireless ad- 
hoc networks. In this paper, we study the duty-cycle-aware MEM 
problem in wireless sensor networks, and we present a formu- 
lation of the Minimum-Energy Multicast Tree Construction and 
Scheduling (MEMTCS) problem. We prove that the MEMTCS 
problem is NP-hard, and it is unlikely to have an approximation 
algorithm with a performance ratio of (1 o(l))lnA, where 
A is the maximum node degree in a network. We propose 
a polynomial-time approximation algorithm for the MEMTCS 
problem with a performance ratio of 0(H(A + 1)), where 
H(-) is the harmonic number. We also provide a distributed 
implementation of our algorithm. Finally, we perform extensive 
simulations and the results demonstrate that our algorithm 
significantly outperform other known algorithms in terms of both 
the total energy cost and the transmission redundancy. 

I. Introduction 

Wireless Sensor Networks (WSNs) are decentralized wire- 
less networks without any preexisting infrastructures, and 
the sensor nodes are usually powered by batteries. Since 
the limited battery lifetime imposes a severe constraint on 
the network performance, it is imperative to develop energy 
conservation mechanisms for WSNs. One common approach 
for energy conservation in WSNs is duty-cycling, in which 
each node switches between active and dormant states, and the 
active/dormant schedule can differ from node to node (TJ-Q- 
Duty-cycling can be easily implemented, and is proved to be 
an effective way for energy conservation [2|. Consequently, 
Duty-Cycled Wireless Sensor Networks (DC-WSNs) have 
often been adopted by various applications [8]-fT0]|. 

Multicasting is a crucial component of wireless networking, 
and it has been applied to WSNs in supporting data dissemi- 
nation for distributed data management (e.g., flT)). Therefore, 
designing an energy efficient multicast protocol is of great 
importance. In an Always-Active Wireless Ad-hoc NETwork 
(AA-WANET), the network topology is static, and each for- 
warding node can cover all its neighboring nodes by only one 
transmission. Therefore, the main task of the Minimum-Energy 
Multicasting (MEM) problem in AA-WANETs is to select 
appropriate forwarding nodes such that a multicast tree with 
the minimum energy cost can be constructed. This problem 



was proved to be NP-hard, and some approximation algorithms 
have been proposed p2|-| 16 1. 

In DC-WSNs, however, new challenges arise. More specif- 
ically, the network topology is now intermittently connected, 
and a forwarding node may transmit the same data packet 
many times to reach its neighboring nodes. Therefore, to 
design energy efficient multicasting algorithms in DC-WSNs, 
not only the forwarding nodes should be selected appropriately 
to construct a multicast tree, but also the transmissions of each 
forwarding node need to be scheduled intelligently to cover 
the receiver nodes and reduce the transmission redundancy. 
These two problems must be handled holistically so that the 
total energy cost can be reduced. Thus, the existing solutions 
for the MEM problem in AA-WANETs are not suitable 
for DC-WSNs, and we need to design new energy efficient 
multicasting algorithms to meet the challenges in DC-WSNs. 

A. Background and Motivations 

The MEM problem in wireless ad-hoc networks has been 
studied in [12|-[16|. Wieselthier et al. (T2| studied the min- 
imum power broadcast/multicast routing problems under a 
scenario where each node can adjust its transmission power 
continuously, and proposed several greedy heuristics. Wan et 
al. fl3) proved that the heuristics proposed by |12| have lin- 
ear approximation ratios, and provided several approximation 
algorithms with constant approximation ratios for the MEM 
problem based on the Minimum Steiner Tree algorithm. Liang 
JT4l considered a scenario in which each wireless node can 
adjust its transmission power in a discrete fashion, and the 
communication links are symmetric. He proposed a centralized 
approximation algorithm with performance ratio 41n|Af| for 
building a minimum-energy multicasting tree, where M is the 
set of destination nodes in a multicast request. Li et al. | p3| 
considered a case in which all nodes have a fixed transmission 
power and the communication links are asymmetric. They 
converted the minimum-energy multicasting problem to an 
instance of the Directed Steiner Tree (DST) problem [17|, 
and presented several heuristics. Liang et al. fT6[ studied the 
minimum-energy all-to-all multicasting problem in wireless 
ad-hoc networks, which aims at building a shared multicast 
tree such that the total energy consumption of performing an 
all-to-all multicast session by this tree is minimized. They 



proved the NP-completeness of this problem and also proposed 
several approximation algorithms. However, all the algorithms 
proposed in fT2")-fl6| took the assumption that the network 
nodes are always-active. Therefore, they are not directly 
applicable to DC-WSNs. 

Recently, the data dissemination problems in DC-WSNs 
have started to attract attentions from the research community. 
Wang and Liu [3] tackled the broadcast scheduling problem in 
DC-WSNs based on the dynamic -programming approach. Guo 
et al. |(4j considered the effect of unreliable links on broad- 
casting, and proposed an opportunistic forwarding scheme to 
reduce the broadcast delay and redundancy in DC-WSNs. 
Hong et al. |6) studied the Minimum-Transmission Broadcast- 
ing (MTB) problem in DC-WSNs. They adopted a restricted 
duty-cycling model, where only one active time slot exists in 
a working period of each node. They proposed a centralized 
algorithm with approximation ratio of 3 (In A + 1) and a 
distributed algorithm with approximation ratio of 20. Note 
that under their particular duty-cycling model, any node can 
determine its optimal transmission schedule in polynomial- 
time for transmitting a data packet to a set of neighboring 
nodes. Therefore, their methods cannot be adapted to our 
case. Xiong et al. [7] adopted the same duty-cycling model 
as that in |6j, and studied the load balancing problem for data 
dissemination in DC-WSNs. They proposed several scheduling 
algorithms for data transmissions such that the maximum 
workload of the forwarding nodes is minimized. However, 
they assumed that the routing paths are pre-computed, and 
their concern is load balancing instead of energy-efficiency. 

To the best of our knowledge, the only work which studied 
the MEM problem in DC-WSNs is J5). In (5), the authors 
adopted another restricted duty-cycling model in which 
the active time slots of any node must be consecutive, and 
proposed two optimal algorithms ("oCast" and "DB-oCast") 
for the MEM problem in DC-WSNs. As the energy cost for 
receiving data was neglected in (5), the algorithms proposed 
in (5) actually minimize the transmission redundancy in a 
multicast session. Most importantly, although oCast and DB- 
oCast were both claimed to be optimal in J5), their time 
complexity grows exponentially with respect to the number 
of destination nodes. 

B. Our Contributions 

In this paper, we study the MEM problem in DC-WSNs 
using a generic duty-cycling model, where each wireless 
node determines its active/dormant schedule without any 
constraints. We formulate the MEM problem for DC-WSNs 
and prove its NP-hardness. We propose an approximation 
algorithm with guaranteed performance ratio, as well as a 
distributed implementation of our algorithm. The contributions 
of our work can be summarized as follows: 

1) We formulate the Minimum-Energy Multicast Tree Con- 
struction and Scheduling (MEMTCS) problem and prove 
its NP-hardness. We also prove that, unless NP C 
DTIME(n 0( - lo s lo s^)) 7 the MEMTCS problem cannot 
be approximated with a performance ratio of (1 — 



o(l))lnA, where A is the maximum node degree of 
the network graph under consideration. 

2) We propose a polynomial-time approximation algorithm 
for the MEMTCS problem with an approximation ratio 
of 12pH(A + l) +4p, where H(-) is the harmonic num- 
ber and p is the approximation ratio of a given algorithm 
for the Minimum Steiner Tree (MST) problem. 

3) We present a distributed implementation of the proposed 
algorithm, and we conduct extensive simulations to 
evaluate the performance of our algorithm. The simula- 
tion results demonstrate that our algorithm significantly 
outperform other known algorithms in terms of both the 
total energy cost and the transmission redundancy. 

To the best of our knowledge, we are the first to present 
a polynomial-time approximation algorithm with provable 
approximation ratio for the MEM problem in DC-WSNs. 

The rest of our paper is organized as follows. In Section II, 
we introduce the wireless network model and formulate the 
MEMTCS problem. In Section III, we analyze the complexity 
of MEMTCS, and we propose an approximation algorithm 
for it. A distributed implementation of the proposed algorithm 
is presented in Section IV. In Section V, we evaluate the 
performance of the proposed algorithm by simulations. Section 
VI concludes the paper. In order to maintain fluency, we 
postpone the proofs to the appendix. 

II. Assumptions and Definitions 

In this section, we first describe our network model and 
related parameters, then we present the formulation of the 
MEMTCS problem that we tackle in this paper. 

A. Network Model and Parameters 

A wireless sensor network is modeled by an undirected 
graph G — (V, E), where V is the set of wireless nodes, 
and E is the set of links. The nodes in V are distributed in 
a two-dimensional plane and each node is equipped with an 
omni-directional antenna. We assume that all nodes have the 
same fixed transmission power, and there exists a link between 
two nodes if they are within the transmission range of each 
other. We also assume that each node has an unique ID and 
knows the IDs of its one-hop neighbors. 

We assume that time is divided into equal-length slots, 
and each time slot is long enough for sending or receiving 
a data packet. Without loss of generality, we assume that the 
working schedule of each node is periodic, and the working 
period of any node has K time slots. To save energy, every 
node switches between active and dormant states, and we 
denote by T(u) the set of active time slots in the working 
period of node u (T(u) C {1,2,..., K} and T(u) ^ 0) . 
The active/dormant schedule is independently determined by 
each node without any constraints. Note that our duty-cycling 
model is similar with the model used in [ 1 1, |4|, and the duty- 
cycling models used in |5)-|(7l can be considered as special 
cases of our model. Following a very common setting in DC- 
WSNs, we assume that a node can wake up its transceiver 
to transmit a packet at any time slot, but can only receive 



a packet when it is active. We also assume that the time 
synchronization is achieved in network, and each node knows 
the active/dormant schedule of its neighboring nodes. These 
are common assumptions in the literature (TJ, (3J-|(7j. 

We denote by e s the energy cost for sending a data packet 
by any node, and denote by e r the energy cost for receiving a 
data packet. It is well known that a wireless sensor node has 
different power consumption levels at different working states 
such as transmitting, receiving, and idle-listening. Let e t x, e rx , 
and en be the energy consumptions of radio for transmitting, 
receiving and idle-listening, respectively. Usually, e tx is larger 
than e rx , whereas en is only slightly smaller than e rx . Since 
a node has to consume at least en when it is active, we set 
e s to be the value of e tx — en, and set e r to be the value of 
p-rx — su. Clearly, e s > e r > 0. 

For the convenience of description, we clarify some other 
notations here. For any node u, we denote by nba(u) the 
set of neighboring nodes of u in G. Suppose that T is an 
arbitrary tree. We denote by N(T) the set of nodes in T. 
Denote by E(T) the set of edges in T. Denote by d}{T) the 
set of nodes in T with degree one. Denote by d + (T) the set 
of nodes in T with degree greater than one. If T is a rooted 
tree, then we denote by nl(T) the set of non-leaf nodes in T, 
and denote by child(u, T) the set of child nodes of node u in 
T. Suppose that r is the root node of T. Clearly, if r G d l {T), 
then n/ (T) = d+(T) U {r}; otherwise nl(T) = d+{T). 

B. The MEMTCS Problem 

In a multicast session, there exists a terminal set M C V 
and a source node s E M, such that the data sent by s should 
be received by all the nodes in M — {s}. A multicast tree T is 
a sub-tree of G which is rooted at s, and each terminal node 
in M is a tree node of T. 

Since all nodes have the same transmission power, the 
energy cost for sending a data packet using a multicast tree 
T can be determined by the number of total transmissions 
of the nodes in nl(T). In DC-WSNs, the time slot at which a 
node transmits the packet decides which neighbors can receive 
it. Therefore, the transmission schedule of each forwarding 
node plays a key role on the energy cost for multicasting. Fur- 
thermore, incorrect schedules can even prevent the destination 
nodes from receiving a data packet. 

For example, consider the wireless network shown in 
Fig. [TJa). Suppose that the terminal set is {ni, ne, 717, Tig}, 
and node n% is the source node. The set of numbers associated 
with each node indicates the active time slots in the working 
period of that node. Fig. |TJb) and (c) are two multicast trees 
T\ and T 2 , respectively. If node n 2 in T\ transmits at time 
slot 3 or 4, then only one child node (ng or U4) can receive 
the data. Therefore, a correct transmission schedule of node 
ri2 for T\ must be a set of time slots which contains {3,4}. 

From this observation, we introduce the concept of "feasible 
schedule". Actually, finding a feasible schedule of any non- 
leaf node in a multicast tree equals to finding a "hitting set" 
ijTSj . We clarify this by Definition [TJ and Definition [2] 




Fig. 1. A DC-WSN graph G (a) and two multicast trees Ti(b) and T 2 (c). 

Definition 1 {Hitting Set [181): Given a collection C of 
subsets of a finite set T, a hitting set is a subset J' C J 
such that T' contains at least one element from each subset 
in C. 

Definition 2 (Feasible Schedule): Given a multicast tree T 
in G, a function B : nl(T) — > 2^ 1 ' 2 '— K ^ is called a feasible 
schedule for T if and only if for any u £ nl(T), B(u) is a 
hitting set of the collection {r(u)|u G child(u,T)}. 

According to Definition [2] the energy cost for sending a 
data packet using a feasible schedule B on multicast tree T 
can be written as >~2ueni(T) ' e s- This is only part of 

the total energy cost in a multicast session, because the energy 
consumption for receiving data also needs to be taken into 
account. In our case, since each forwarding node u in nl(T) 
knows the duty-cycling schedules of all its neighboring nodes, 
it can always transmit at the time slots in B(u) such that each 
node in child(u,T) receives the same data packet only once. 
Therefore, the energy cost for receiving a data packet using 
multicast tree T can be written as (|JV(T)| — 1) • e r . 

Based on the above discussions, we introduce the Minimum- 
Energy Multicasting Tree Construction and Scheduling 
(MEMTCS) problem in Definition [3] 

Definition 3 (MEMTCS): Given a DCWSN G, a terminal 
set M C V, and a source node s <E M, the MEMTCS problem 
seeks a 2-tuple (T opt , B opt ) in which T opt is a multicast tree 
rooted at s and B opt is a feasible schedule for T opt , such that 
the total energy cost 

n(T opt ,B opt ) = l^o P t(w)|-e s + (|iV(T opt )|-l)-e r 

uenl(T opt ) 

is minimized. 

For example, we can find two feasible schedules B\ and B2 
for T\ and T 2 in Fig. |TJ respectively, such that = {1}, 

Bi(na) - {3,4}, B^m) = {2,3}, and B 2 {n x ) = B 2 {n 3 ) = 
#2(715) = {3}, B2(n$) = {2}. Suppose that e s — 10 and 
e r = 2. We can get n(Ti,Bi) = 60 and n(T 2 ,B 2 ) = 50. 
Actually, (T 2 ,B 2 ) is the optimal solution for n\ to send data 
to {n 6 ,n 7 ,n 8 } in Fig. [I] 

III. Solving the MEMTCS Problem 

We first briefly evaluate the hardness of MEMTCS. We 
prove it is NP-hard by a reduction from the Minimum Hitting 
Set (MHS) problem |18|, and we claim this in Theorem 1: 

Theorem 1: The MEMTCS problem is NP-hard. 



The MHS problem was proved to be equivalent to the Min- 
imum Set Cover (MSC) problem |T8), Q9j. Moreover, Fiege 
[20 1 has proved that, unless NP has quasi-polynomial time 
algorithms, there does not exist a polynomial-time algorithm 
for the MSC problem with performance ratio of (1 — o(l))lnn, 
where n is the size of the MSC problem. Therefore, with the 
proof of Theorem [T] we can easily get: 

Corollary 1: Unless NP C DTIME (n ( loglos ")), there 
does not exist a polynomial-time approximation algorithm 
with performance ratio of (1 - o(l))lnA for the MEMTCS 
problem, where A is the maximum node degree of network 
graph under consideration. 

Next, we propose an approximation algorithm for the 
MEMTCS problem. We first provide a brief overview of 



our algorithm in Section III-A then describe our methods in 
details in Section PI-BUlII-EI 

A. An Overview of the Proposed Algorithm 

Our approximation algorithm consists of several steps. 
Firstly, we use a graph transformation method to transform 
the original network graph G into an extended graph G 
where the possible transmitting time slots of the nodes in 
G are represented as satellite nodes, and the nodes in G are 
connected in a particular way to facilitate the design of our 
approximation algorithm. Secondly, we propose the concept of 
Minimum Satellite Bridge (MSB) in G as well as an algorithm 
for finding an approximation MSB. The MSB is actually a 
special tree in G whose nodes can cover all the nodes in M. 
Finally, we map the approximation for MSB to a multicast 
tree in G and a feasible schedule for the multicast tree, which 
serves as an approximate solution to the MEMTCS problem. 

To find the approximation ratio of our algorithm, we propose 
another concept Minimum Isotropic Scattering Tree (MIST), 
which is a special multicast tree Tj in G spanning the nodes in 
M, We prove that Tj serves as a quantitative "bridge" between 
the number of nodes in a MSB and (T opt , B opt ). As a result, 
we obtain the approximation ratio of our algorithm. 

B. Graph Transformation 

The first step of our approach is to transform the original 
network graph into an extended graph. We introduce the 
concept of the extended graph in Definition [4] 

Definition 4 (Extended Graph): The extended graph of G 
is an undirected graph G = (V, E), where V is the set of 
nodes and E is the set of edges. The nodes and edges in G 
are created by the following steps: 

(i) Initially, V = V and E = 0; 

(ii) For each node u G V and each time slot i g 

\Jvenb G (u) T ( v )> create a new node A(tt,i) in V. The 
node X(u, i) is called a satellite node of u on slot i, 
and u is called a nuclear node of X(u, i). The set of all 
satellite nodes of u is denoted by *£(u); 

(iii) For each node u £ V, create an undirected edge between 
each pair of nodes in *&(u) U {it}. In other words, the 
sub-graph induced by ^>(u) U {u} is a complete graph; 



(iv) For each edge (u, v) £ E, each time slot i E T(v) and 
each time slot j € create three undirected edges 

(X(u,i),X(v,j)), (X(u,i),v) and (u,X(v,j)). 
From Definition [4J we can see that V can be partitioned 
into two disjoint subsets: V and Vs, where V is the set of all 
nuclear nodes, and Vs is the set of all satellite nodes. A nuclear 
node may have multiple satellite nodes, but any satellite node 
only has one nuclear node. An example of the extended graph 
is shown in Fig. 2. 




Fig. 2. A DC-WSN graph G (left) and its extended graph G (right). 

According to the construction rules in Definition |4] we can 
obtain some useful properties of G, as described in Lemma [7] 
and Lemma [2] Their proofs are omitted due to page limits. 
Lemma 1: There are at most (K + 1)\V\ nodes and 

k 1 ' ^ 'V\+3K 2 \E\ edges in G. 



Lemma 2: Any two nuclear nodes cannot be adjacent in G, 
and any satellite node in Vs can be adjacent to at most A + 1 
nuclear nodes in G. 

C. Minimum Satellite Bridge 

We first introduce the concept of Minimum Satellite Bridge 
(MSB) in Definition [5] and then we propose an approximate 
algorithm for finding a MSB. As we will see later, finding 
a MSB is an important building block of our algorithm for 
solving the MEMTCS problem. 

Definition 5 (Minimum Satellite Bridge): Given 
G = (V,E) and the terminal set M C V, a Satellite 
Bridge SB is a sub-tree of G which satisfies: 

1) The nodes in SB are all satellite nodes; 

2) Each node in the terminal set M is adjacent to at least 
one node in SB. 

The Minimum Satellite Bridge (MSB) SB* is an SB with a 
minimum number of nodes. 

Next, we propose an approximation algorithm with a perfor- 
mance ratio of O (H(A + 1)) for finding a MSB, as shown 
in Algorithm [T] This algorithm consists of two stages. The 
first stage is lines 1-6, and the second stage is line 7. In the 
first stage, we use a greedy set cover algorithm to find a small 
satellite node set G that can cover all the nodes in M. In each 
loop, we first find a node v which has the maximum number 
of adjacent nodes in the uncovered node set UC (line 3). Then 
we add v into G (line 4) and update U C (line 5). In the second 
stage (line 7), an approximate Steiner tree algorithm is applied 
upon G. 

Since each satellite node can cover at most A+l nodes in 
M (according to Lemma [5J, the first stage of Algorithm 1 has 



Algorithm 1 Finding an approximate MSB 

Input: The extended graph G and the terminal set M, 
Output: An approximate MSB SB. 

l: C = 0, UC = M; 
2: while UC ^ do 

3: v = argmaxng^-c \nb^{u) n UC\; 
4: C = CU{v}; 
5: E/C = UC - (nb d (v) n UC); 
6: end while 

7: Let G s be the sub-graph of G induced by Vs. Assign 
each edge in G s a weight of 1. Compute an approximate 
minimum Steiner tree ST in G s which connects the nodes 
inC; 

8: return SB = ST. 



an approximation ratio of H(A + 1) |21|. The approximation 
ratio of the second stage is a constant p. By taking advantage 
of the special structure of the extended graph, we can get: 

Theorem 2: The approximation ratio of Algorithm [T] is 
3pH(A + 1) + p, where H(-) is the harmonic number and p 
is the approximation ratio of the algorithm used in Algorithm 
1 for finding a minimum Steiner tree. 

D. Minimum Isotropic Scattering Tree 

Now we show that an MSB can be mapped to a special 
multicast tree in G called the Minimum Isotropic Scattering 
Tree (MIST) and a feasible schedule for the internal nodes 
in the MIST. Although MSB does not produce an optimal 
solution to the MEMTCS problem, we show that it leads 
to an approximation for MEMTCS, and we use MIST as a 
medium to derive the approximation ratio. We first introduce 
the concept of MIST in Definition |6j 

Definition 6 (Minimum Isotropic Scattering Tree): For 
any tree T in G and any node u £ d + (T), we denote 
by T(u, T) a minimum hitting set of the collection 
{T(v)\v G nb T (u)}, and define S(T) = E ued+(T) \T(u,T)\. 
The Minimum Isotropic Scattering Tree (MIST) Tj is a tree 
in G such that Tj spans M and that 2(Tj) is minimized. 

We claim that B(Tj) actually equals to the number of nodes 
in a minimum satellite bridge, which is proved by Lemma [3] 
Lemma |4] and Theorem [3] Lemma [3] claims that the number 
of nodes in a MSB is no more than S(Tj), whereas Lemma [4] 
implies that S(Tj) is no more than the number of nodes in 
a MSB. With these two lemmas, Theorem 3 can be readily 
proved. 

Lemma 3: \N(SB*)\ < B(2». 

Lemma 4: Any satellite bridge SB can be mapped to a 2- 
tuple (R,F), where R is a tree in G spanning the nodes in 
M, and F is a function that satisfies: 

1) For any u € d + (R),F(u) is a hitting set of the 
collection {r(u)|i! S nbn(u)}; 

2) J2ue d+(R) 

\F(u)\ < \N(SB)\. 
Theorem 3: H(Tj-) = \N(SB*)\ 

The proofs take advantage of the special structure of the 
extended graph. In addition, the proof of Lemma [4] actually 



provides a method of mapping any satellite bridge to a 
multicast tree and a feasible schedule for the internal nodes in 
the multicast tree. This mapping can be roughly described as 
follows. According to the construction rules of the extended 
graph, the satellite nodes in a satellite bridge can be mapped 
to the nuclear nodes that they belong to, as well as the 
transmitting time slots on these nuclear nodes. Furthermore, 
we can find a multicast tree spanning these nuclear nodes and 
the destination nodes in M, and the internal nodes in the 
multicast tree are all the mapped nuclear nodes. According 
to the special node-connecting method of the extended graph, 
the mapped transmitting time slots of any internal node in the 
multicast tree can cover all its neighboring nodes in the tree. 

E. Approximation Algorithm for MEMTCS 

Based on the methods introduced by the previous sections, 
we propose our algorithm for the MEMTCS problem, as 
shown in Algorithm [2] 

Algorithm 2 Approximation for MEMTCS 

Input: A DC-WSN G, a terminal set M, and a source node 

se M. _ _ 

Output: A multicast tree T and a feasible schedule B. 
1: Construct the extended graph G — (V, E) of G; 
2: Use Algorithm |T]j:o compute an approximate minimum 

satellite bridge SB; 
3: Use the method in the proof of Lemma 4 to map SB to a 

2-tuple (T, F). Let T be the rooted tree got by designating 

s as the root of T; 
4: for each node u € nl(T) do 
5: if u e d+(T) then 
6: B{u) = F{u); 

7: else 

8: Let v be u's child node in T. Find an arbitrary 

9: i e T(v) and let B(u) = {i}; 

it): end if 
li: end for 
12: return (T,B). 

The output of Algorithm's a 2-tuple (T, B). From Lemma 
4, It is easy to know that T is a multicast tree spanning the 
nodes in M and B is a feasible schedule for T. Next, we prove 
the approximation ratio of Algorithm[2]by Lemma|5] Lemma|6] 
and Theorem [4] Lemma [5] is actually based on a special 
property of the MHS problem, i.e, if we add a subset in an 
instance of the MHS problem, then the cardinality of the result 
minimum hitting set will increase at most 1 . Using Lemma [5] 
Lemma [6] finds out a quantitative relationship between H(T/) 
and (T op t, B opt ), which is used in the proof of Theorem [4] 

Lemma 5: For any node u E d + (T opt ), we have 

|T(u,T opt )| < |B opt (u)| + 1. 
Lemma 6: (T opt , B opt ) is related to Tj by 

Euem(T jBo P t(u)\ > E(T T ) - |d+(T opt )| + 1. 



Theorem 4: The approximation ratio of Algorithm [2] is 
12pH(A + 1) + 4p. 

The dominating running time of Algorithm |2] is the time on 
constructing SB in line 2, using Algorithm Uj Lines 1-6 in 



Algorithm [lj can be implemented in 0(|V| ) time. If we use 
the 2-approximation algorithm proposed in [22 1 to compute an 
approximate minimum Steiner tree in line 7, the resulting time 
complexity is 0(| log |V^| + \E\). Given K as a predefined 
constant, the time complexity of Algorithm's 0(|I / | 2 + |£'|), 
and the approximation ratio is 2477 (A + 1) + 8. 

IV. Distributed Implementation 

In this section, we provide a distributed implementation of 
Algorithm [2] for the MEMTCS problem. Note that the main 
operation of Algorithm [2] is line 2, in which Algorithm [T] is 
called to compute an approximate minimum satellite bridge. 
Therefore, we first propose the distributed implementation of 
Algorithm [T] 

As we have described in Section [Tll-C| Algorithm[T]consists 
of two stages: the first stage is lines 1-6, in which a greedy 
strategy is used to find a small satellite node set C covering 
the nodes in M, and the second stage is line 7, in which an 
approximate minimum Steiner tree is computed. The first stage 
of Algorithm [T] can be decentralized in a way similar with the 
distributed dominating set algorithm in J23) , which is shown 
in Algorithm [3] 

Algorithm 3 Distributed implementation of the first stage of 
Algorithm [T] 

1: Each white node u with non-empty rnb(u) broadcasts a 

message ("election", \rnb(u)\, u.ID); 
2: Each red node checks all the "election" messages it 

receives, and finds a node v whose value of \rnb(v)\ is 

the maximum (break tie by choosing the node with largest 

ID). Then it sends v a message "you win"; 
3: If a white node u receives "you win" messages from all 

nodes in rnb(u), then it colors itself blue, and broadcasts 

a message "I am dominator"; 
4: If a red node receives an "I am dominator" message, then 

it colors itself green, and broadcasts a message "I am 

dominated"; 

5: If a white node u receives a "I am dominated" message 
from a neighboring node w, then it deletes w from rnb{u). 

In Algorithm [3] each node in G is colored red, green, white, 
or blue. The red nodes are the nodes in M which are not 
covered yet. The green nodes are the nodes in M which are 
already covered by some blue nodes. The blue nodes are the 
nodes which are selected into the resulting node set G, and 
the white nodes are the nodes which are not selected. Initially, 
each node in M is colored red, and all other nodes are colored 
white. Besides, each white node u owns a set rnb(u), which 
is initialized to be the set of IDs of all it's red neighbors. 

It can be seen that Algorithm [3] is a faithful implementation 
of the greedy strategy in lines 1-6 of Algorithm [T[ so the 



approximation ratio of Algorith m [3| is H(A + 1). There are at 
most \M\ rounds in Algorithm |3| because it repeats until no 
red nodes exist, and at least one red node turns green in each 
round. Therefore, the message complexity of Algorithm [3] is 
0(\M\.\V\) = 0(\M\.\V\). 

The second stage of Algorithm [T] can be decentralized by 
using a distributed Steiner tree algorithm in the literature [24 1- 



|26|. If we adopt the 2-approximation distributed algorithm 
proposed in (24J, then the message complexity is C(|M| ■ 
\V\) and the time complexity is 0(|Af| ■ D), where D is the 
diameter of G. 

The distributed implementation of Algorithm [2] (except line 
2) is trivial: an arbitrary spanning tree of the sub-graph induced 
by N(SB) needs to be found in line 3. To accomplish this, 
the distributed Depth-First Search (DFS) algorithm proposed 
by Makki et al. (27) can be applied. The time complexity and 
message complexity of the distributed DFS algorithm are both 
0{\V\). 

Based on these discussions, we can get Corollary 2: 
Corollary 2: There exists a distributed algorithm for 
MEMTCS. It has an approximation ratio of 24H (A + 1) + 8. 
The time complexity and message complexity of the dis- 
tributed algorithm are 0(D ■ \V\) and 0(\M\ ■ \V\), respec- 
tively, where D is the diameter of G. 

Though our distributed implementation for Algorithm [2] is 
based on the extended graph G, it can be easily adapted to run 
on G. According to the construction method of G, any satellite 
node in Vs can be seen as a local "pseudo node" administrated 
by its nuclear node in V. Therefore, each nuclear node can 
send messages for its satellite nodes, and do the computation 
that its satellite nodes need to do. Let A be the distributed 
algorithm running on G, and let A be the adapted version of 
A running on G. If several satellite nodes of the same nuclear 
node u send their messages simultaneously to other nuclear 
nodes in A, then u in A can send these messages at different 
time slots. Note that any nuclear node has at most K satellite 
nodes and if is a predefined constant. Therefore, if we define 
a constant 6(5 > K), then any nuclear node u in A can receive 
messages for all its satellite nodes after waiting <5 time, and 
then u can do the computation for its satellite nodes based 
on the messages it receives. The message passing processes 
between the satellite nodes administrated by the same nuclear 
node in A become local computations of that nuclear node in 
A, and no new messages are generated in A. Therefore, A 
has the same time and message complexity as A. 

V. Performance Evaluation 

In this section we evaluate the performance of our algorithm 
via simulations. The simulations focus on the effect of various 
network conditions on the performance of various multicasting 
algorithms. In the simulations, we deploy wireless nodes 
randomly in a 1000m x 1000m square, and the transmission 
range of each node is set to 300 meters. Each node randomly 
picks some time slots in the working period as its active time 
slots. Without loss of generality, the energy cost for sending 




and receiving a data packet by any node is set to 100 and 15, 
respectively. 

To the best of our knowledge, there is no polynomial-time 
minimum-energy multicasting algorithms designed for DC- 
WSNs under a generic duty-cycling model. So we have to 
compare our algorithms with several traditional multicasting 
algorithms, including the Shortest Path Tree (SPT) algorithm, 
the Approximate Minimum Steiner Tree (AMST) algorithm, 
and the minimal data overhead tree (the MNT algorithm) 
proposed by (28), (29). The SPT algorithm computes shortest 
paths from the source node to the receiver nodes, and aggre- 
gates these shortest paths to construct a multicast tree. The 
AMST algorithm computes an approximate minimum Steiner 
tree spanning all the nodes in the terminal set M. Here, we 
adopt the AMST algorithm provided by Kou et al. J30| , which 
was also used by Liang et al. [ fT6| to solve the minimum- 
energy all-to-all multicasting problem in always-active wire- 
less ad-hoc networks. The MNT algorithm was designed for 
reducing the multicast redundancy in static wireless ad-hoc 
networks. The work in [28 1 and [29 1 has proved that MNT 
can reduce the redundant transmissions in a multicast session 
more effectively than other heuristics Q 

To use the multicast trees constructed by the SPT, AMST 
and MNT algorithms in a DC-WSN environment, we need 
to find a transmission schedule for each forwarding node 
in the multicast trees. Obviously, the most energy-efficient 
transmission schedule for any forwarding node it in a mul- 
ticast tree T is the minimum hitting set of the collection 
{r(v)|i> £ child(u,T)}. However, finding a minimum hit- 
ting set is a NP-hard problem. Therefore, we use a greedy 
hitting set algorithm pT) to find the transmission sched- 
ules of the forwarding nodes. Since the greedy algorithm is 
essentially the best-possible polynomial-time approximation 
algorithm for the minimum hitting set problem (unless NP C 
DTIME(n°^°s l °s")))^ each forwarding node in the multicast 
trees generated by the SPT, AMST and MNT algorithms has 
the best-possible energy-efficient transmission schedule. 

In the simulations, we compare our algorithms with the 
traditional ones using two metrics including the total energy 

'We refrain from comparing with the algorithm proposed in [ 14| and oCast 
15], as they both have a much higher complexity than ours: C(|M| 2 • \V\ 2 ) 
for the former and exponential in \M\ for the latter. 



cost and the transmission redundancy. Note that minimizing 
the transmission redundancy is an important optimization 
objective both in multicasting and in broadcasting |3)-l6j. 
However, to the best of our knowledge, there is no other work 
that provides polynomial-time algorithms with guaranteed ap- 
proximation ratios for minimizing the transmission redundancy 
of multicasting/broadcasting in DC-WSNs under a generic 
duty-cycling model. 

In Fig. [3] we compare Algorithm [2] (denoted by TCS) with 
SPT, AMST and MNT. The length of working period is set to 
20, and the percentage of terminal nodes scales from 20% to 
100% with an increment of 10%. The algorithm proposed in 
pU) is adopted in Algorithm 2 for computing an approximate 
minimum Steiner tree. Fig. [3|a) and (b) plot the number of 
total transmissions in a multicast session. The number of 
network nodes is set to 100 and 300, respectively. We can 
see that MNT outperforms SPT and AMST greatly, because 
the multicast tree generated by MNT has less forwarding 
nodes (non-leaf nodes) than the other multicast trees |28|. 
We also see that TCS significantly outperform the other 
algorithms, and the transmission redundancy is reduced by 
about 20% as the percentage of terminal nodes approaches 
90%. The reason is that, since the traditional SPT, AMST 
and MNT algorithms generate multicast trees regardless of 
the duty cycles of the wireless nodes, they can't optimize the 
transmission schedules of the forwarding nodes in a global 
manner. Therefore, although the transmission schedules are 
optimized locally in the SPT, MST and MNT algorithms using 
a best-possible optimization algorithm (the greedy algorithm), 
their transmission redundancies are still high. On the contrary, 
the TCS algorithm builds the multicast tree and finds the 
transmission schedules of the forwarding nodes in a holistic 
manner by taking advantage of the special structure of the 
extended graph, so the transmission redundancy is reduced 
more effectively than the traditional algorithms. 

Fig. [3jc) and (d) plot the total energy costs of different 
multicasting algorithms. Again, the numbers of network nodes 
in Fig. 3(c) and (d) are set to 100 and 300, respectively. 
We can see that the TCS algorithm still outperforms the 
other algorithms. The explanation is that, compared with the 
other algorithms, the TCS algorithm significantly reduces the 
number of total transmissions, and generates a multicast tree 
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Fig. 4. Performance evaluation of various multicasting algorithms under 
scenarios with different lengths of the working period. The length of the 
working period scales from 5 to 40. The number of nodes in V is 150 and 
the number of nodes in M is 100. 



with moderate number of nodes. As a result, the total energy 
cost of the TCS algorithm is lower than the other algorithms 
when both the energy cost for sending data and the energy cost 
for receiving data are considered. We also notice that the TCS 
algorithm performs better when the percentage of destination 
nodes increases. This is because that more redundant trans- 
missions are reduced by TCS when the multicast tree grows. 

In Fig. [4] we study how the length of the working period 
impacts the performance of our algorithms. The number of 
network nodes is set to 150, and the number of terminal 
nodes is set to 100. The length of the working period scales 
from 5 to 40 with an increment of 5. Fig. |4^a) and (b) plot 
the number of total transmissions and the total energy cost 
for multicasting, respectively. It is evident that our algorithm 
still outperform the other algorithms when the length of the 
working period changes. Meanwhile, we also see that, for all 
algorithms compared in Fig. |4f a)-(b), both the transmission 
redundancy and the total energy cost do not vary much. An 
explanation is that, since the active time slots of any node are 
randomly selected from the working period, the number of 
common active time slots of any two different nodes does not 
vary much when the length of the working period increases. 
Therefore, the performance of all these algorithms are rather 
independent of the length of the working period under our 
generic duty-cycling model. 

VI. Conclusion 

In this paper, we have studied the Minimum-Energy Mul- 
ticasting (MEM) problem in Duty-Cycled Wireless Sensor 
Networks (DC-WSNs). We formalized the Minimum-Energy 
Multicasting Tree Construction and Scheduling (MEMTCS) 
problem, and proved its NP-hardness. A lower bound on the 
approximation ratio of any polynomial-time algorithm for the 
MEMTCS problem was given in our work. We presented an 
approximation algorithm with guaranteed approximation ratio 
for the MEMTCS problem, and proposed a distributed imple- 
mentation of our algorithm. The simulation results demonstrate 
that our algorithm outperform other related algorithms in terms 
of both the total energy cost and the transmission redundancy. 



Appendix 
Proofs for the MEMTCS problem 

Proof of Theorem 1: Given an instance (C, J-) of the 
minimum hitting set problem, we create a wireless network 
graph G by the following method: 

Let the elements in T be {/i, /2, f p }, and let the subsets 
in C be Ci, C2, C q . For each Cj(l < j < q), create a node 
Vj in G, and let T(vj) = < i < p) A (/, € Cj)}. Create 
a node x in G , and connect x to each Vj(l < j < q). 

Let x be the source node and {x} U {Vj\l < j < g}be the 
terminal set M in multicasting. Let e s = 1 and e r = 0. It is 
easy to prove that T has a hitting set for C of size at most k 
if and only if G has a multicast tree T and a feasible schedule 
B for T such that U(T, B) < k. Therefore, the MEMTCS 
problem is NP-hard. □ 
Proof of Theorem 2: Each node u € Vs may cover a 
set nbg(u) fl M, whose cardinality is bounded by A + 1 (see 
Lemma 2). Therefore, lines 1-6 of Algorithm [T] intrinsically 
represent a greedy algorithm for finding a minimum set cover 
pT[ . Suppose that C* is a smallest set of satellite nodes 
covering the nodes in M, we have: 

\C\<H(A + l)\C*\ (1) 

According to Definition [5] we know that the nodes in SB* 
also cover the nodes in M. Hence 

\C*\<\N(SB*)\ (2) 

Let ST* be a minimum Steiner tree in G s which connects the 
nodes in C. We have: 

\E(SB)\ < p\E(ST*)\ (3) 

For any node c G C — N(SB*), there must exist a node 
to e M adjacent to c. Since SB* is a satellite bridge, to must 
be adjacent to a certain node t in SB*. For convenience, we 
assume that c ^ \&(m) and t ^ ^(m) (otherwise the theorem 
can be proved in a similar way). According to Definition |4|iv), 
we know that there must exist two nodes c',t' 6 \&(m) such 
that c' is adjacent to c, and tf is adjacent to t. Since d and t' 
are both satellite nodes of to, if d 7^ tf, then d and t' must be 
adjacent, according to Definition |4|iii). Therefore, there exists 
a path from c to t in G s whose length is no more than 3, 
as shown in Fig. [5] In other words, there must exist a tree in 
G s whose node set contains N(SB*) U C, and it has at most 




Fig. 5. Connecting the node c in C to SB* 

\N(SB*)\ + 3\C\ - 1 edges. Since ST* is a minimum Steiner 
tree that connects the nodes in C, it follows that: 

\E{ST*)\<\N(SB*)\+3\C\-1 (4) 



Finally, with equation ([T])-(|4|, we can get: 

\N(SB)\ = \E(SB)\ + 1 < p\E(ST*)\ + l 

< p(|JV(SB*)|+3|C|-l) + l 

< p(|JV(SB*)| + 3.ff(A + l)|C*|-l) + l 

< (3pH(A + l)+p)\N(SB*)\-(p-l) 

< (3pH(A + l)+p)\N(SB*)\, 

hence the approximation ratio 3pH(A + 1) + p. □ 
Proof of Lemma 3: Let Si be the set {X(u,i)\u £ 
d+(T/) Ai € T(m,T/)}. Clearly, |S*i| = 3(Tj). Furthermore, 
we have the following statements: 

1) According to Definition |4|iii), we know that for any 
node u £ d + (Tj) and any i\,i2 £ T(u),ii ^ i 2 , the 
edge (X(u, X(u, 1%)) is in E. 

2) For any two neighboring nodes v\ and u 2 in d + (Tj), 
there must exit j\ £ Y(«i,7j) n r(v 2 ) and j 2 G 
Y(v 2 ,T/) n r(w x ). According to Definition Qiv), the 
edge (A(wi, ji), A(w2,j 2 )) is in £. 

3) For any node to 6 M, if m € d + (Tj), then for any 
i € T(m,Tj), X(m,i) is adjacent to to, according to 
Definition |4^iii). If m £ d 1 (T/), then there must exist 
a node mi € d + (Tj) adjacent to m. Since T(mi,Tj) 
is a minimum hitting set of the collection {r(t>)|t; £ 
nbTj(mi)}, there must exist k\ £ Y(toi, Tj) n T(to). 
Therefore, A(mi,fci) € Si. According to Definition 
^iv), we have (A(mi, fci), to) £ £7. 

From 1) and 2) it is easy to know that the sub-graph induced 
by S\ in G is connected. From 3) we know that every node 
in M is adjacent to a node in Si. Let SB\ be an arbitrary 
spanning tree of the sub-graph induced by Si. Then SB\ must 
be a satellite bridge. Since SB* is a minimum satellite bridge, 
we have \N{SB*)\ < |JV(SBi)| = |5i| = S(I». □ 
Proof of Lemma 4: Suppose that the nodes in SB belong 
to q nuclear nodes. Therefore, N(SB) can be partitioned into 
q mutually disjoint subsets: A±,A2,. . . ,A q , such that all the 
nodes in Ai have the same nuclear node at, and <Zj 7^ a,j for 
any i 7^ j,l < i,j < q. Let A be the set {a\, a 2 , a q }. 
According to Definition [5] for any node u € M — A, we can 
find a node u' £ N(SB) adjacent to u. So we can connect 
u to SB by adding the edge (u,u'). When all the nodes in 
M — A are connected to SB, we get a new tree SB'. 

Then we consider the graph G = (V',E r ) whose node 
set V is M U A and edge set E' is E[ U £J 2 , where E[ = 
{(ai,aj)\3(u,v) e SB' Au e Ai Av e Aj A(l < i ^ j < q)} 
and E' 2 = {(a l7 v)\3(u,v) £ SB' A u £ A t A v £ (M - A) A 
(1 < i < q)}. From Definition Qiv), it is easy to know that 
G' is a connected sub-graph of G, and all nodes in M — A 
are degree-one nodes in G' . 

Let R be an arbitrary spanning tree of G' . Clearly, we have 
M C N(R) and d+(R) C A. Let F(ai) = {l\3\(a h l) € 
for 1 < i < q. Clearly, |F(a,)| = |A;|. For en £ A and 
u> e nbji(ai), we have: 

1) If (ai, w) £ E[, then there must exist aj £ A , u £ Ai 
and v £ Aj such that w = aj ,(u, v) £ SB' and i 7^ j. 



According to Definition |4fiv), there must exist a time 
slot l\ £ r(aj) such that u = X(ai,l±). Note that l\ 
is also in F{ai). Therefore, F(a,i) n T(w) = F(a,) n 

r( Oj -)^0. 

2) If (ai,w) £ E'2, then there must exist u £ Ai adjacent 
to w. By reasoning similar to 1), we also have F(ai) fl 
T(w) ? . 

From 1) and 2), we know that any F(ai), 1 < i < q is 
a hitting set of the collection {r(v)|w £ ni>fl(a,*)}. Since 
C A, we have E„ ed+(fl ) \F(u)\ < T, u eA = 
Ekk, I^KOI = Ei<i< g 1^1 = □ 
Proof of Theorem 3: From Lemma 4 we know that 
we can find a tree i?* in G spanning the nodes in M and a 
function F* such that: 

1) For any u £ d + (R*), F*(u) is a hitting set of the 
collection {r(u)|v £ nbji*(u)}; 

2) Eue d+ (R*)\ F *( u )\<\ N ( SB *)l 

Clearly, we also have Y,ued+(R*) l T ( u ' ^*)l - 
E^ch.)!^^)!. and E(Tj) < E^^.) |T( U , 
Therefore, S(T Z ) < E„ ed +( fl .) I^(«)l < W(SB*)\. 
From Lemma 3 we know \N(SB*)\ < 3(2/). Therefore, 
H(2» = |JV(SB*)|. □ 

Proof of Lemma 5: From Definition [3] we know 
that S op t(w) must be a minimum hitting set of the col- 
lection {r(v)|t> £ child (u,T opt )}. If u is the root of 
T opt , child(u,T opt ) — nbT opt (u). Therefore, we have 
|T(w,T opt )| = \B opt (u)\ < \B opt (u)\ + 1. 

Otherwise, let u' be the parent node of u in T opt . Find an 
arbitrary j £ T(u'), then B opt (u) U {j} must be a hitting 
set of the collection {r(v)|t; £ nbT opt (u)}- So we still have 
\T(u,T opt )\ < \B opt (u) U {j}\ < \BZ pt (u)\ +1. □ 

Proof of Lemma 6: If s £ d 1 (r opt ), then |B opt (s)| = 1. 
With Lemma 5, we have: 

Y, \B op t(u)\ = \B opt (s)\+ Y, l^ptHI 

u£nl(T opt ) u£d+(T opt ) 

> 1+ Y (T(u,T oP t)-l) 
«erf+(T opt ) 

= S(T opt )-|rf+(T opt )| + l 

If s £ d+(T opt ), then |B op t(«)| = |T(s,T opt )|. Therefore, 
we still have: 

Y \B op t(u)\ = \B opt (s)\+ Y l^optWI 

uGnl(T opt ) «ed+(T opt )\{s} 

> |T(s,T opt )|+ Y d T K T o P t)|-l) 

«ed+(T opt )\{s} 

= £ |T(n,T opt )|-(|d + (T opt )|-l) 

ued+(T opt ) 

= H(T opt )-|rf+(T opt )|+l 

Now E„ e „ K Topt) l B opt(«)| > S(T/) - |d+(T opt )| + 1 
follows from the fact that S(T opt ) > S(Tr). □ 



Proof of Theorem 4: Using Lemma |6j we have: 

n(T op t, -B pt) 

53 \B op t(u)\-e s + {\N(T opt )\-l)-e r 

u£nl(T apt ) 

> (H(Tf) - |d+(T opt )| + 1) • e s + (\N(T opt )\ - 1) • e r 
= (H(2» + 1) • e s + (\N(T opt )\ - 1) • e r - \d + (T opt )\ ■ e s 

Since each node in d + (T opt ) must transmit at least once, 
we have |d+(T opt )| • e s < H(T opt , B opt ), and hence: (S(T/) + 
1) • e s + (|JV(Topt)| - 1) -e r < 2n(T opt ,S opt ). 

Let a = 3pH(A + 1) + p. With Lemma |5] Theorem [2] and 
Theorem [3] we have: 

|fl(u)| = ^ |F(«)| < |JV(5B)| 

u£d+(T) u£d+(T) 

< a\N(SB*)\ = oS(T/) 

Besides, we have some evident inequalities: e r < e s , 
|^(r)| J \M\ < \N(T opt )\, and |B(u)| > 1 (for any 
it € d + (T)). Using these inequalities, we can get: 

n(T,s) 

= ^ |B(«)|-e, + (|JV(5 5 )|-l).e r 

53 |S( U )|-e s + (| ( i+(r)| + |d 1 (T)|-l).e I . 

u£d+(T)u{s} 



e s +53 e s + (|M| - 1) • e r 

u£d+(T) 



< 2 J] |-B(«)| + 1 -e s + (|iV(T opt )|-l)-e r 

< (2a • H(2» + 1) • e s + (|AT(T opt )| - 1) • e r 

< 2a • ((H(2» + 1) • e s + (|iV(T opt )| - 1) • e r ) 

< 4a-U(T opt ,B opt ) 

= (12piJ(A + l)+4p)II(T opt ,£ opt ), 

hence the approximation ratio 12pH(A + 1) + 4p. □ 
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